Progress in learning 3 vs. 2 keepaway

نویسندگان

Gregory Kuhlmann

Peter Stone

چکیده

Reinforcement learning has been successfully applied to several subtasks in the RoboCup simulated soccer domain. Keepaway is one such task. One notable success in the keepaway domain has been the application of SMDP Sarsa(λ) with tile-coding function approximation [9]. However, this success was achieved with the help of some significant task simplifications, including the delivery of complete, noise-free world-state information to the agents. Here we demonstrate that this task simplification was unnecessary and further extend the previous empirical results on this task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolving Static Representations for Task Transfer

An important goal for machine learning is to transfer knowledge between tasks. For example, learning to play RoboCup Keepaway should contribute to learning the full game of RoboCup soccer. Previous approaches to transfer in Keepaway have focused on transforming the original representation to fit the new task. In contrast, this paper explores the idea that transfer is most effective if the repre...

متن کامل

Genotypic versus Behavioural Diversity for Teams of Programs under the 4-v-3 Keepaway Soccer Task

Keepaway soccer is a challenging robot control task that has been widely used as a benchmark for evaluating multi-agent learning systems. The majority of research in this domain has been from the perspective of reinforcement learning (function approximation) and neuroevolution. One of the challenges under multi-agent tasks such as keepaway is to formulate effective mechanisms for diversity main...

متن کامل

On Diversity, Teaming, and Hierarchical Policies: Observations from the Keepaway Soccer Task

The 3-versus-2 Keepaway soccer task represents a widely used benchmark appropriate for evaluating approaches to reinforcement learning, multi-agent systems, and evolutionary robotics. To date most research on this task has been described in terms of developments to reinforcement learning with function approximation or frameworks for neuro-evolution. This work performs an initial study using a r...

متن کامل

Learning Through Interaction

Reinforcement learning is an approach for learning optimal action policy via experiencing, i.e. using observed reward in environment states. Reinforcement learning algorithms include adaptive dynamic programming, temporal difference learning and Q-learning[1]. Examples of successful applications of reinforcement learning are controller for sustained inverted flight on an autonomous helicopter [...

متن کامل

Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway

RoboCup Keepaway, originated from the RoboCup soccer simulation 2D challenge, has been widely used as a machine learning benchmark. In this paper, we present a concurrent hierarchical reinforcement learning approach to RoboCup Keepaway. Following the idea of hierarchies of abstract machines (HAMs), we write a partial policy as a HAM from the perspective of a single keeper, run multiple instance...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Progress in learning 3 vs. 2 keepaway

نویسندگان

چکیده

منابع مشابه

Evolving Static Representations for Task Transfer

Genotypic versus Behavioural Diversity for Teams of Programs under the 4-v-3 Keepaway Soccer Task

On Diversity, Teaming, and Hierarchical Policies: Observations from the Keepaway Soccer Task

Learning Through Interaction

Concurrent Hierarchical Reinforcement Learning for RoboCup Keepaway

عنوان ژورنال:

اشتراک گذاری